SEQUENCE LISTING 



<110> Dickey, Lynn 

Gasdaska, John 
Cox, Kevin 



<120> Expression of Biologically Active 
Polypeptides in Duckweed 

<130> 40989/267934 

<150> US 09/915,873 
<151> 2001-07-26 

<150> US 60/293,330 
<151> 2001-05-23 

<150> US 60/221,705 
<151> 2000-07-31 

<160> 16 

<170> FastSEQ for Windows Version 4.0 

<210> 1 
<211> 554 
<212> DNA 
<213> Zea mays 

<400> 1 



gatcaagtgc aaaggtccgc cttgtttctc ctctgtctct tgatctgact aatcttggtt 60 
StgaScgt tgagLat?t tggggaaagc ttcgtccaca gttttttttt cgatgaacag 120 
tqccgcagtg gcgctgatct tgtatgctat cctgcaatcg tggtgaactt atgtctttta 180 
tatccttcac Lcca^gaaa agactagtaa tctttctcga tgtaacatcg tccagcactg 40 
ctattaccgt gtggtccatc cgacagtctg gctgaacaca tcatacgata ttgagcaaag 3 00 
atctatctfc ccfgttcttt aatgaaagac gtcattttca tcagtatgat ctaagaatgt 360 
?acaacttgc aaggaggcgt ttctttcttt gaatttaact aactcgttga gtggccctgt 420 
t?c?cggacg SaggSttt gctgctccac acatgtccat tcgaatttta ccgtgtttag 480 
caaggfcgal aa^tgcat cttgatgatt tagcttgact atgcgattgc tttcctggac 540 
ccgtgcagct gcgg 



<210> 2 
<211> 498 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Duckweed codon optimized nucleotide sequence 
encoding human alpha -2B interferon 

<221> CDS 



<222> (1) ■ • • (498) 



£c = s = = = = = 1 - = 5 = ? s 

2 2 5 a 2 S S S 2 = 2 = * = 5 = 

20 25 

S 2 C - 5 2 S = = = £ - ~ = £ 2 
2 S si 3 S 2 C 2 = S 2 S 2 2 £ - 

50 55 

aac ct g etc ago ac= aag go a f «oj jc= jo. « gje £9 - ct 9 
Asn Leu Phe Ser Thr Lys Asp Ser Ser Axa Axa P gQ 

7 0 

2 S 2 £ s = a = = = = = 5 5 s 

85 90 

H 5 S S = S S 2 S 2 2 2 2 2 2 2 

100 105 

2 5 S S 2 E S S S 5 2 2 5 S = = 



115 



£ 2 2 S 2 3 5 S S 5 K 5 2 S 2 S 

130 " 5 140 

„„„ ,-t-e aac eta age acc aac etc cag gag age 

S S S S S S £ Kr Le» sir Thr * 8 n «, san 0! U ser 

145 150 I 55 

etc cgc tec aag gag taa 
Leu Arg Ser Lys Glu * 
165 



96 



144 



192 



240 



288 



336 



384 



432 



480 



498 



<210> 3 
<211> 96 
<212> DNA 

<213> Oryza sativa 
<400> 3 

accatgeagg tcctgaacac gatggtcaac 
gtcctcctcg ggctgagcag caacctcacc 



aagcacttcc tctccctgtc cgtcctcatc 60 

96 

geegge 
2 



<210> 4 
<211> 188 
<212> PRT 

<213> Homo sapiens 



<400> 4 
Met Ala 
1 

Lys Ser 



Leu Thr 



Ser Cys 
20 

Gly Ser Arg Arg 
35 

Ser Cys 



Leu Phe 
50 

Glu Phe 
65 

Glu Met 

Ala Ala 

Gin Gin 

Thr Glu 
130 
Tyr Phe 
145 

Cys Ala 
Ser Thr 



Gly Asn 
He Gin 



Trp Asp 
100 
Leu Asn 
115 

Thr Pro 

Gin Arg 

Trp Glu 

Asn Leu 
180 



Phe Ala 
5 

Ser Val 

Thr Leu 

Leu Lys 

Gin Phe 

70 
Gin He 
85 

Glu Thr 



Leu Leu 

Gly Cys 

Met Leu 

40 
Asp Arg 
55 

Gin Lys 
Phe Asn 
Leu Leu 



Val Ala 
10 

Asp Leu 
25 

Leu Ala 
His Asp 
Ala Glu 



Asp Leu 

Leu Met 

He Thr 
150 
Val Val 
165 

Gin Glu 



Glu Ala 
120 
Lys Glu 
135 

Leu Tyr 
Arg Ala 
Ser Leu 



Leu Phe 
90 

Asp Lys 
105 

Cys Val 

Asp Ser 

Leu Lys 

Glu He 
170 
Arg Ser 
185 



Leu Leu Val 

Pro Gin Thr 

Gin Met Arg 
45 

Phe Gly Phe 
60 

Thr He Pro 
75 

Ser Thr Lys 



Phe Tyr Thr 



He Gin Gly 
125 

He Leu Ala 

140 
Glu Lys Lys 
155 

Met Arg Ser 
Lys Glu 



Leu Ser Cys 
15 

His Ser Leu 
30 

Arg He Ser 

Pro Gin Glu 

Val Leu His 
80 

Asp Ser Ser 
95 

Glu Leu Tyr 
110 

Val Gly Val 



Val Arg Lys 

Tyr Ser Pro 
160 

Phe Ser Leu 
175 



<210> 5 

<2H> 165 

<212> PRT 

<213> Homo sapiens 



<400> 5 

Cys Asp Leu Pro Gin 

1 5 
Leu Leu Ala Gin Met 
20 

Arg His Asp Phe Gly 
35 

Glu Thr He 



Thr His Ser Leu 
Arg Arg 
Phe Pro 



Lys Ala 
50 

Asn Leu 
65 

Leu Asp 
Ala Cys 
Glu Asp 
Tyr Leu 



Phe Ser Thr 

Lys Phe Tyr 
85 

Val He Gin 
100 

Ser He Leu 
115 

Lys Glu Lys 



Pro Val 

55 
Lys Asp 
70 

Thr Glu 
Gly Val 
Ala Val 
Lys Tyr 



He Ser 

25 
Gin Glu 
40 

Leu His 



Gly Ser 
10 

Leu Phe 
Glu Phe 
Glu Met 



Ser Ser 
Leu Tyr 



Arg Arg Thr 

Ser Cys Leu 
30 

Gly Asn Gin 
45 

He Gin Gin 
60 

Trp Asp Glu 



Ala Ala 
75 

Gin Gin Leu Asn Asp 
90 

Thr Glu 



Gly Val 
105 

Arg Lys Tyr Phe 
120 

Ser Pro Cys Ala 
3 



Thr Pro Leu 
110 

Gin Arg He 

125 
Trp Glu Val 



Leu Met 
15 

Lys Asp 

Phe Gin 

He Phe 

Thr Leu 
80 

Leu Glu 
95 

Met Lys 



Thr Leu 
Val Arg 



Al, 01° He „at «, « S Sar Leu Ser Thr « Leu Gin Glu « 

145 150 
Leu Arg Ser Lys Glu 
165 



<210> 6 
<211> 31 
<212> PRT 

<213> Oryza sativa 



£?H val Leu Ae„ Thr Met val » Lys His Phe Lau Ser Leu ser 

Va! Lau II. V.! Lau Lau Gly Lau Sar Sar Asn Lau Thr Ala Gly 
20 25 



<210> 7 
<211> 31 
<212> PRT 

<213> Artificial Sequence 



<223> Modified rice alpha-amylase signal peptide 

retain Val Leu Asn Thr Met Val Asn Lys His Phe Leu Ser Leu Ser 

Val Leu lie Val Leu Thr Val Leu Ser Ser Asn Leu Thr Ala Gly 
20 25 



<210> 8 
<211> 21 
<212> PRT 

<213> Arabidopsis thaliana 



<400> 8 T -Mo dv.^ qpr Leu Leu Leu Ser 

Met Lys Thr Asn Leu Phe Leu Phe Leu lie Phe Ser Leu ^ 

Leu Ser Ser Ala Glu 
20 



<210> 9 

<211> 498 

<212> DNA 

<213> Homo sapiens 



<400> 9 

tgtgatctgc ctcaaaccca 
atgaggagaa tctctctttt 
gaggagtttg gcaaccagtt 
cagcagatct tcaatctctt 



cagcctgggt agcaggagga 
ctcctgcttg aaggacagac 
ccaaaaggct gaaaccatcc 
cagcacaaag gactcatctg 
4 



ccttgatgct cctggcacag 60 
atgactttgg atttccccag 12 0 
ctgtcctcca tgagatgatc 180 
ctgcttggga tgagaccctc 240 



ctagacaaat tctacactga act™ cgctgtj jca^JJgo ££££ 300 
cagggggtgg gggtgacaga 9-tcccctg ^gaaggagg ttgtgcctgg 420 

aaatacttcc aaagaatcac tctctatctg | caacaaactt gcaagaaagt 480 

gaggttgtca gagcagaaat catgagatct ttttctttgt c 4g8 



ttaagaagta aggaatga 

<210> 10 

<211> 569 

<212> DNA 

<213> Homo sapiens 

atggccttga cctttgcttt actggtggcc ctcctggtgc tcjjctgcj. gtcaagctgc 60^ 
tcJgtgggct gtgatctgcc tc.aacccac ^ctgggta ^ 
ctggcacaga tgaggagaat "tctcttttc tcctgct g tgtcctccat 240 

tttccccagg aggagtttgg caaccagttc caaaaggctg tgct tgggat 300 

gagatgatcc agcagatctt caatctcttc agcjca gg t cct ggaagcc 360 

gagaccctcc tagacaaatt ctacactgaa ^ctaccagc 9 9 ct ccattctg 420 

?gtgtgatac agggggtggg 99tgacagag actcccctga tgaagg^^ atacagccct 48Q 
gctgtgagga aatacttcca aagaatcact ^tatctga aaca aacttg 54 0 

tgtgcctggg aggttgtcag agcagaaatc atgagatctt tttc a ^ 
caagaaagtt taagaagtaa ggaatgaaa 

<210> 11 
<211> 62 
<212> DNA 

<213> Arabidopsis thaliana 

; 4 4» 9 acta atctttttct ctttctcatc ttttcacttc tcctatcatt at=ctc 9g c= 60 
ga 

<210> 12 
<211> 78 
<212> DNA 

<213> Artificial Sequence 

<220> 
<221> CDS 

< <223 > > Duckwee^codon optimized sequence encoding hGH 

signal peptide 
<223> Duckweed codon optimized hGH signal peptide 

<400> 12 . rc ttc Q qc Ctq etc 48 

S S S 5 £ S S S S S 2 K - S, ~ - 



1 



5 



tgc ctg ccc tgg etc cag gag ggc tec gcg 
Cys Leu Pro Trp Leu Gin Glu Gly Ser Ala 
20 25 



78 



<210> 13 



5 



<211> 26 
<212> PRT 

<213> Homo sapiens 



^AUThr Gly Ser Arg Thr Ser Leu Leu Leu Ma Phe Gly Leu Leu 

1 5 10 

Cys Leu Pro Trp Leu Gin Glu Gly Ser Ala 
20 25 



<210> 14 
<211> 573 
<212> DNA 

<213> Artificial Sequence 

<220> 
<221> CDS 

< 2 2 2 2 2 3> iuckweed'codon optimized sequence encoding mature 
hGH 

<223> Duckweed codon optimized sequence encoding hGH 

sis s s 2 s s Leu 2 s s a s a s 48 



5 



^ rhr rac caa ctq gee ttc gac acc tac cag gag ttc gag 
III Se S -u £ G?n Leu Ala Phe up Thr Tyr Gin Glu Phe Glu 

20 25 

S E 5 s s s a = s - = - = = = s 

40 " 



144 



tar ttc tec qag tec ate ccc acc ccg tec aac cgc 192 
X £ Sr Leu Cys Phe Ser Glu Ser He Pro Thr Pro Ser Aau Ar, 

6 u 



50 



55 



gag gag acg cag cag aag age aac ctg gag ctg etc cgc ate tec ctg 
Glu oS Thr Gin Gin Lys Ser Asn Leu Glu Leu Leu Arg He Ser Leu 

7 0 

2 2 s a s s = a s - = - = 5 i s 

85 90 95 

2 E 2 2 S 2 £ 35 E 2 S 2 2 2 2 5 



288 

Val 

95 

336 



100 



105 HO 



2 S 5 5 2 2 2 Gi? S 2 £ £ 2 5 S = 384 

115 12 0 125 

6 



s 5 s s s s = s = s s s = s s = 432 

130 135 

aag ttB 9 ac acc aac jjc car aac f c g .c g =c etc etc aa g aac tac 
Lys Phe Asp Thr Asn Ser His Asn n&p « * 160 
lis "0 I" 

5 2 S S 5 £ S S 5 2 5 - - = 1 s 

165 170 

etc cgc ate gtg cag tgc egc tee gtg gag ggc tec tge ggc ttc 
Leu Arg He Val Gin Cys Arg Ser yal Glu Gly Ser Cys y 
180 185 



480 



528 



573 



<210> 15 
<211> 191 
<212> PRT 

<213> Homo sapiens 



Phe°Pro 5 Thr lie Pro Leu Ser Arg Leu Phe Asp Asn Ala Met Leu Arg 
aL His Arg Leu /is Gin Leu Ala Phe Asp Thr Tyr Gin Glu Phe Glu 



20 25 



Glu Ala Tyr lie Pro Lys Glu Gin Lys Tyr Ser Phe Leu Gin Asn Pro 

40 

Gin Thr Ser Leu Cys Phe Ser Glu Ser He Pro Thr Pro Ser Asn Arg 
Glu G°iu Thr Gin Gin Lys Ser Asn Leu Glu Leu Leu Arg He Ser Leu 



65 



Leu Leu lie Gin Ser Tr P Leu Glu Pro Val Gin Phe Leu Arg Ser Val 
Phe ,1, Asn S,r Leu Val Tyr Gly Ala Ser Asp Ser Asn val Tyr Asp 

100 105 



Lea Lea Lys Asp Lea Glu Glu Gly He Gin Thr Leu M.r Gly Arg Leu 

Glu Asp "ly ser Pre Ar g Thr Gly Gin lie Phe Lys Gin Thr Tyr Ser 

Lys 111 Asp Thr Asn ser "is Asn Asp Asp Ala Leu Leu Lys Asn Tyr 

SS Leu Leu Tyr Cys Phe Ar g Lys Asp Met Asp Lys Val Glu Thr Phe 

165 I/O 
L eu Arg He Val Gin Cys Arg Ser Val Glu Gly Ser Cys Gly Phe 



180 185 



<210> 16 

<211> 63 

<212> DNA 

<213> Lemna gibba 



; 4 a ract=== 9 , 9 9t g , 9 caa gg atcc gg , 9 ™»« B 
gcg 



8 



